AMENDMENTS TO THE CLAIMS 



Claims 1-4. (Canceled) 

5. (Currently Amended) A videoconferencing system comprising: 

i 

a conference bridge for interconnecting a plurality of remotely located 
videoconferencing stations; and 

a speaker identification subsystem to determine means for determining whether a 
conferee is speaking based, at least in part, on whether visual lip movements of said 
conferee are reasonably consistent with an audio signal from a conference station in 
which said conferee is locate d, the subsystem also to determine which of the conferees is 
speaking the loudest when multiple conferees are speaking simultaneously , 

6. (Currently Amended) The videoconference system of claim 5 wherein said m e ans 
for determining whether said conf e r ee is sp e aking the speaker identification subsystem 
comprises a voice activity detector. 

7. (Canceled) 

8. (Canceled) 

9. (Previously Presented) The videoconference system of claim 6 wherein said voice 
activity detector includes image analysis and recognition software. 
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10. (Currently Amended) The videoconference system of claim 29 wherein said 
means for visually altering said image comprises means for highlighting a border around 
said image of said conferee determined to be sp e aking the loudest speaker. 

1 1 . (Currently Amended) A videoconference station comprising: 

a transmitter to transmit a combined audio video signal to a videoconference 
bridge; and 

means for determining wh e th e r a confer e e a speaker identification subsystem 
located at said videoconference station to determine whether the conferee at said 
videoconference station is speaking by analyzing whether visual lip movements of said 
conferee are substantially consistent with an audio signal at said station so as to indicate 
human speech , the subsystem also to determine which of the conferees is the loudest 
speaker when multiple conferees are speaking simultaneously . 

12. (Currently Amended) The videoconference station of claim 1 1 wherein said 
mnnns for d e t e rmining wh e th e r naid conferee is speaking is speaker identification 
subsystem comprises a voice activity detector. 

13. (Previously Presented) The videoconference system of claim 12 wherein said 
voice activity detector includes image analysis and recognition software. 
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1 4. (Currently Amended) A method of displaying images of a plurality of conferees 
in a videoconference system, comprising: 

determining whether a conferee is speaking by analyzing a consistency between 
visual lip movements of said conferee and an audio signal from a conference station in 
which said conferee is located such that the combination of lip movement and audio 
signal indicates human speech; 

determining which of the conferees is the loudest speaker when multiple 
conferees are speaking simultaneously; and 

visually altering an image of said conferee that is displayed to other conferees 
when said conferee is determined to be sp e aking the loudest speaker . 

15-18. (Canceled) 

19. (Withdrawn) A method for identifying which conferee in a video conference is 
speaking comprising: 

detecting a first audio signal associated with a first conferee; 

detecting a second audio signal associated with a second conferee; 

comparing the first detected audio signal with the second detected audio signal to 
determine which detected audio signal is louder; and 

providing an indication to the first conferee and the second conferee of which 
detected audio signal is louder. 
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20. (Withdrawn) The method of claim 19, wherein providing an indication to the first 
conferee and the second conferee of which detected audio signal is louder comprises: 

altering an image of either the first conferee or the second conferee based, at least 
in part, on which detected audio signal is determined to be louder. 

21 . (Withdrawn) The method of claim 20, wherein altering the image of either the 
first conferee or the second conferee based, at least in part, on which detected audio 
signal is determined to be louder comprises: 

highlighting a border around the image of either the first conferee or the second 
conferee based, at least in part, on which received audio signal is determined to be louder. 

22. (Withdrawn) The method of claim 19, wherein providing an indication to the first 
conferee and the second conferee of which detected audio signal is louder comprises: 

providing textual information to the first conferee and the second conferee 
indicating which detected audio signal is determined to be louder. 

23-28. (Canceled) 

29. (Currently Amended) The videoconferencing system of claim 5, further 
comprising: 

means for visually altering an image of said conferee displayed in other 
conference stations if said conferee is determined to be sp e aking the loudest speaker of 
the plurality of conferees . 



Application No.: 09/587,990 
Atty. Docket No.: 42390.P10406C 



-5- 



Examiner: G. Eng 
Art Unit: 2643 



30. (New) A videoconferencing system comprising: 

a speaker identification subsystem located at each of a plurality of remotely 
located videoconferencing stations, the subsystem to determine whether a conferee is 
speaking based, at least in part, on whether visual lip movements of the conferee are 
reasonably consistent with an audio signal from a conference station in which the 
conferee is located, and also to determine which of the conferees is speaking the loudest 
when multiple conferees are speaking simultaneously; and 

a conference bridge coupled to each of the videoconferencing stations to visually 
alter an image of a conferee who is speaking before combining the image with images 
received from each of the other videoconferencing stations and transmitting a combined 
image back to each of the videoconferencing stations. 

3 1 . (New) A method for identifying which conferee in a video conference is speaking 
comprising: 

detecting a first audio signal associated with a first conferee; 

detecting a second audio signal associated with a second conferee; 

comparing the first detected audio signal with the second detected audio signal to 
determine which detected audio signal is louder; and 

providing an indication to the first conferee and the second conferee of which 
detected audio signal is louder. 

32. (New) The method of claim 31, wherein providing an indication to the first 
conferee and the second conferee of which detected audio signal is louder comprises: 
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altering an image of either the first conferee or the second conferee based, at least 
in part, on which detected audio signal is determined to be louder. 

33. (New) The method of claim 32, wherein altering the image of either the first 
conferee or the second conferee based, at least in part, on which detected audio signal is 
determined to be louder comprises: 

highlighting a border around the image of either the first conferee or the second 
conferee based, at least in part, on which received audio signal is determined to be louder. 

34. (New) The method of claim 31, wherein providing an indication to the first 
conferee and the second conferee of which detected audio signal is louder comprises: 

providing textual information to the first conferee and the second conferee 
indicating which detected audio signal is determined to be louder. 
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